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Abstract: A typical bottleneck of model predictive control algorithms is the computational 
burden in order to compute the receding horizon feedback law which is predominantly deter- 
mined by the length of the prediction horizon. Based on a relaxed Lyapunov inequality we 
present techniques which allow us to show stability and suboptimality estimates for a reduced 
prediction horizon. In particular, the known structural properties of suboptimality estimates 
based on a controllability condition are used to cut the gap between theoretic stability results 
and numerical and practical observations. 



1 Introduction 

Model predictive control (MPC), also termed receding horizon control, is a well established 
method for the optimal control of linear and nonlinear systems [5 ,22 and also widely used in 



industrial applications, cf. |3 ,14 . The method generates a sequence of finite horizon optimal 
control problems in order to approximate the solution of an infinite horizon optimal control 
problem, the latter being, in general, computationally intractable. For each resulting sequence 
of control values only the first element is implemented at the corresponding time step which 
renders this method to be iteratively applicable and generates a closed loop static state feedback. 
Due to the replacement of the original infinite by a sequence of finite control problems, stability 
of the resulting closed loop may be lost. To ensure stability, several modifications of the 



finite horizon control problems such as stabilizing terminal constraints 17 or a local Lyapunov 
function [T] as an additional terminal weight have been proposed. 

Here, we focus on the computationally attractive plain form of MPC without the mentioned 
stabilizing modifications. For such a control loop stability and suboptimality has been shown 



in via a relaxed Lyapunov inequality, see also 20,23 for the linear case. However, a 
controllability assumption or knowledge on the optimal value function for the finite horizon 
subproblem is required in order to deduce concise bounds on the required prediction horizon 
length which is a restrictive condition especially for nonlinear systems. Furthermore, stability is 
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ensured for a set of initial values. Indeed, the derived bounds may be rather conservative since 
stable behavior of the MPC controlled closed loop can be observed in many examples although 
stability can be guaranteed by means of these theoretical results for a longer prediction horizon 



only, cf. 24 



Our goal consists of designing an MPC algorithm which ensures desired stability properties 
at runtime for a given initial condition for significantly smaller prediciton horizons. From a 
practical point of view reducing the prediction horizon and, thus, the computational compu- 
tational burden of the MPC algorithm is important. To this end, we also employ a relaxed 
Lyapunov inequality but do not aim at verifying it at each sampling instant separately. Here, 



our starting point is an observation from 13 : implementing more than only the first element 



of the computed open loop sequence of control values and checking the relaxed Lyapunov in- 
equality at the next update time enhances the suboptimality bound from 11 for a large class of 



control systems. But, since doing so may be harmful in terms of robustness, cf. 19 , additional 
conditions on the structure of consecutive MPC problems are presented which guarantee the 
same relaxed Lyapunov and, thus, stability to be maintained if the control loop is closed more 
often. 

Additionally, we deduce stability and performance estimates which allow us to violate the 
relaxed Lyapunov inequality temporarily, much alike the watchdog technique used in nonlinear 
optimization [g]. To this end, we use an idea similar to [S] and monitor the progress made in 
the decrease of the value function along the closed loop. 

The paper is organized as follows: In the following section the considered MPC problem and 
a trajectory based stability theorem from |12j are presented. Then, Section [s] generalizes this 
result to an MPC setting which allows for implementing longer parts of the sequence of control 
values computed at an update time instant. Furthermore, a first algorithm utilizing this weaker 
condition is derived. To carry over the improvements of the suboptimality bound to standard 
MPC an algorithmic approach is presented in Section |4j The issue of an exit strategy is 
addressed in Section |5] which allows us to temporarily violate the relaxed Lyapunov inequality 
and to further improve previous suboptimality estimates. Last, conclusions are drawn in Section 
[6] Throughout the work a simple example repeatedly provides insight into the improvements 
of the proposed results. 



2 Problem Setup and Preliminaries 

Let No denote the natural numbers including zero and M^j" the nonnegative real numbers. A 
continuous function 7 : Mq — )■ Mq is said to be of class /Coo if it satisfies 7(0) = 0, is strictly 
increasing and unbounded. Furthermore, a continuous function /3 : x Mj" — )■ is of class 
KC if it is strictly decreasing in its second argument with limt_s.oo /3(r, t) = for each r > and 
satisfies (3{-,t) G /Coo for each t>0. 

Let X and U be arbitrary metric spaces equipped with the metrics dx '■ X x X M.q and 
du : U X U ^ M.Q. In this work nonlinear discrete time control systems of the form 

x{n + 1) = f{x{n),u{n)), x{0) = Xq (1) 

for n G No are considered. x{n) G X C X and u{n) G U C f/ represent the state and control of 
the system at time instant n. Hence, X and U stand for the state and the control value space, 
respectively. Since X and U are arbitrary metric spaces, the following results are applicable 
to discrete time dynamics induced by a sampled - finite or infinite dimensional - systems, see, 
e.g., [2,15 . Here, using subsets X and U allows for incorporating constraints. Throughout this 



work, the space of control sequences u : No — )■ U is denoted by U^". 
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For control system ([T]) we want to construct a static state feedback u = iJi{x) which asymptot- 
ically stabilizes the system at a desired equilibrium x* via model predictive control. The 
trajectory generated by the feedback map /x : X — )■ U is denoted by x^{n) = x^{n;xo), 
n = 0, 1, . . .. In order to define asymptotic stability rigorously, the open ball with center x 
and radius r is denoted by Br{x) and the distance from x to the equilibrium x* is represented 
by llxll^;* := dx{x,x*). 

Definition 2.1 

Let a static state feedback : X — )■ U satisfying f(x*,ix{x*)) = x* for system ([T]) be given, i.e., 
X* is an equilibrium for the closed loop. Then, x* is said to be locally asymptotically stable if 
there exist r > and a function /?(■, ■) G /C£ such that, for each xo G Br{x*) fl X, 

x^{n]Xo)eX and \\x^{n;xo)\\x* < f3{\\xo\\:r*,n) Vn G Nq. (2) 

W.l.o.g. existence of a control value m* G U such that f{x*, u*) = x* holds is assumed through- 
out this work. The quality of the feedback is evaluated in terms of the infinite horizon cost 
functional 

oo 

Joo{xq,u) = ^i{x{n),u{n)) (3) 

n=0 

with continuous stage cost i : X x U satisfying i{x*,u*) = and i{x,u) > for 

all M G f/ for each x ^ x*. The optimal value function corresponding to (|3| is denoted by 
Voo{xo) = inf„guf<o Joo{xo,u). Now, utilizing Bellman's principle of optimality for the optimal 
value function V"oo(-) yields the Lyapunov equation 

V^ixo) = mf{i{xo,u) + V^{f{xo,u))}. (4) 

In order to avoid technical difficulties, let the infimum of this equality with respect to m G U be 
attained, i.e., there exists a control u*^ G U such that Voo{xo) = £{xo, u*^) + Voo{f {xq, u*^)) . u*^ 
is called the argmin of the right hand side of Q — albeit uniqueness of the minimizer u*^ is 
not required. In case of uniqueness the argmin-operator can be understood as an assignment, 
otherwise it is just a convenient way of writing "u*^ minimizes the right hand side of (|4])". 
Hence, an optimal feedback law on the infinite horizon is defined via 

fiooix{n)) := argmin {i{x{n), u) + Voc{f{x{n),u))} = argmin {£{x{n), u) + Voo{x{n + 1))} . 

(5) 

In general, the computation of the control law ([s]) requires the solution of a Hamilton-Jacobi- 
Bellman equation which is, in general, very hard to solve. In order to avoid this burden, a model 
predictive control approach is used which approximates the infinite horizon optimal control. 
MFC consists of three distinct steps: First, given measurements of the current state xq, an 
optimal sequence of control values over a truncated and, thus, finite horizon is computed which 
minimizes the cost functional 

Jn{xo,u) = y^J{xuik;xo),u{k)). 

Here, Xu{-',Xo) denotes the trajectory emanating from xq corresponding to the input signal 
m(-). As a result, we obtain the open loop control un{-',xo) = argmin^g^^ Jn{xo,u) and the 
corresponding open loop state trajectory 

Xut,{n + 1;xq) = f{xuj^{n;xo),UN{n;xo)) (6) 
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for = 0, . . . , — 1 with initial value (0; Xq) = Xq. In a second step, the first element of the 
open loop control is employed in order to define the feedback law fiN{xo) := un{0;xo)- Last, 
the feedback is applied to the system under control revealing the closed loop system 

x^^{n + l;xo) = /(x^^(n;xo),/i7v(x/,^(n;xo))) (7) 

for which the abbreviation Xfj_^{-) will be used whenever the initial value xq is clear. The closed 
loop costs with respect to the feedback law at (•) are given by V^'^(xo) = ■^(•^/^iv(^)' A^a^(^mjv' 

and Vp^{xo) = inf^g^^ Jn{xo,u) denotes the optimal value function corresponding to Jn{xo, ■). 
In the literature endpoint constraints or a Lyapunov function type endpoint weight are used 
to ensure stability of the closed loop, see, e.g., [7|[9|[T6}[T7j. Here, we focus on MPC version 
without these modifications. Hence, feasibility of the MPC scheme is an issue that cannot be 
neglected. In order to exclude a scenario in which the closed loop trajectory runs into a dead 



end, the following viability condition is assumed, see also 12, Chapter 8] for generalizations 



Assumption 2.2 

For each a; G X there exists a control value Ux &U satisfying f{x, Ux) € X. 



According to 12, Proposition 7.6], stable behavior of the closed loop trajectory can be guaran- 



teed for such a controller using relaxed dynamic programming, cf. 18 



Theorem 2.3 

(i) Consider the feedback law hn : X — )• U and the closed loop trajectory x^^{-) of ^ with 
initial value Xq G X. // the optimal value function Vn : X — )■ satisfies 

V)v(x^j^(n)) > V)v(x^^(n + 1)) + a£(a;^^(n),/XAr(x^j^(n))) Vra G Nq (8) 

for some a G (0, 1], then the performance estimate 

aVUx.M) < aV^-{x,M) < Vn{x,M) < V^x^M) (9) 
holds for all n G Nq- 

(a) If, in addition, there exist ai, 0:2, 0:3 G /Coo such that a;i(||x||3;*) < Vn{x) < a;2(||x||a;*) and 
i{x,u) > a3{\\x\\x*) hold for x = x^j^iji) G X, n G No, then there exists (5 G /C£ which only 
depends on 01,0:2, as and a such that ^ holds, i.e., x^^{-) behaves like a trajectory of an 
asymptotically stable system. 



Within Theorem 2.3 the relaxed Lyapunov Inequality (|8j) is the key assumption. From the 
literature, it is well-known that this condition is satisfied for sufficiently long horizons A, 
cf. [1 , 10, 16 , and that a suitable A may be computed via methods described in [8, 12 . The 



suboptimality index a in this inequality can be interpreted as a lower bound for the rate of 
convergence. Furthermore, Inequality ^ yields a performance bound for the MPC closed loop 
in comparison to the optimal costs on the infinite horizon. 

3 Shortening the prediction horizon by weakening the 
relaxed Lyapunov inequality 

For the described MPC setting we want to guarantee stability and a certain lower bound on 
the degree of suboptimality with respect to the infinite horizon optimal control law Hoo{')- 
Yet, at the same time the optimization horizon A shall be as small as possible. These goals 
oppose each other since it is known from the literature, see, e.g., [T , 10, 16 , that stability can 
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only be guaranteed if the optimization horizon is sufficiently long. Keeping small, however, 
is important from a practical point of view since the horizon length is the dominating factor 
regarding the computational burden. Here, motivated by theoretical results deduced in [Ts], 



our aim consists in improving stability and suboptimality bounds from 12, Proposition 7.6] 
and, thereby, allowing for smaller optimization horizons. 

Note that Condition (|8| is a sufficient, yet not a necessary condition. In particular, even if 
stability and the desired performance Estimate ^ cannot be guaranteed via Theorem 2.3 



stable and satisfactory behavior of the closed loop may still be observed, even in the linear 
case, as illustrated by the following example: 

Example 3.1 

Let X = X = ]R^,U = [/ = ]Rbe given and consider the linear control system x{n + 1) = 
Ax{n) + Bu{n) with quadratic stage cost function i{x,u) := Qx + u^Ru from 23 with 



A 



1 1.1 
-1.1 1 



B 



Q 



1 
1 



For the given system, the optimal finite horizon costs VAr(a;o) = 
puted via the Ricatti difference equation using the initial condition Pi = Q according to 



and R= {I). 



I PnXq may be can be com- 



P. 



A' 



p,-PjB{b^p,b + r)-'b^p, 



A + Q. 



Then, the resulting open loop control law un{-,x) is given by 



UN{k; x) 



{B^PN-kB + R) 



B'^Pn-uAx. 



{k;x) 



k = 0,1, . . . , N — 2, and un{N — 1;x) = 0, cf. |20j. Now, we implement the Ricatti feedback 
in the usual receding horizon fashion fiiy{x{n)) := U]\[{0; x{n)) with = 3 and evaluate the 
suboptimality bound from Theorem 2.3 along the closed loop. As displayed in Figure [T]^ left) a 
typical trajectory tends towards the origin quickly. However, Figure [Tj^right) shows that for the 
set of initial values X := {(cos(27rfc/fcmax), sin(27rA;/fcmax)) I ^ = 1, 2, . . . , fcmax} with k^aax = 
there exists a nonempty subset of initial values for which we obtain a < in ([s]). Hence, 
stability cannot be deduced from Theorem 2^ Here, we like to mention that stability can be 
guaranteed for all initial values Xq G A" by means of Theorem 2.3 if we choose A^ = 4. Note 
that in [20| Section 6] it has been shown in a similar manner that the closed loop is stable if 
A^ > 5. 



Motivated by the fact that the computational effort grows rapidly with respect to the prediction 
horizon A^, it is not desireable to choose A^ larger than necessary to guarantee the relaxed 
Lyapunov inequality ([8| to hold. Our goal in this work is to develop an algorithm which allows 



us to check whether stability and performance in the sense of Theorem 2.3 can be guaranteed 
for a prediction horizon which is smaller than the prediction horizon required for ([s]) to hold. 
In order to further motivate this approach, we consider Example 3.1 again. 



Example 3.2 

Implement the ffist two MFC steps and consider the quantity 



V)v(xo) - VAf(a;^^(2;xo)) 
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Figure 1: Left: Illustration of the used set of initial values (grey) together with a typical closed 
loop solution (black). Right: Illustration of the subset of initial values (black) of X (grey) for 
which a > in (Isl) does not hold. 



For each xq E X we have that aN,2{xQ) > holds. Hence, defining a :- 
yields 



infa;QeA'aAr,2(a;o) > 



1 



Vn{xo) > Vn{x^^{2]Xo)) + a ( ^ ^(x^^(n; Xq), /i7v(x^^(n; Xq))) | , (10) 

\n=0 

i.e., the relaxed Lyapunov inequality with a positive suboptimality index a after two steps. 
Indeed, this conclusion holds true for all a; G X = X which can be shown by using the MFC 



feedback computed in Example 3.1 



In particular. Example 3. 2| shows that a generalized type of relaxed Lyapunov inequality similar 
to ( 10 ) may hold after implementing several controls despite the fact that the central assumption 
([s]) of Theorem 2.3 is violated. Checking the relaxed Lyapunov inequality ^ less often may 
help in order to ensure desired stability properties of the resulting closed loop. Note that in 



a more complicated setting than the one considered in Example |3.2| it may not be possible 
to guarantee such an inequality for a set of initial conditions. Hence, implementing an MFC 
feedback law although ([s]) is violated may lead to severe stability problems. Here, we aim 
at designing a strategy which ensures - a priori and at runtime of the corresponding MFC 
algorithm - that a relaxed Lyapunov inequality is fulfilled after m G {1, 2, . . . , iV — 1} steps. 
Our first attempt is motivated by an observation from fT3| Section 7] . In this reference estimates 
for the suboptimality degree a for a set of initial conditions are deduced and the following fact 
has been proven: if more than one element of the computed sequence of control values is applied, 
then the suboptimality estimate a is increasing (up to a certain point). To incorporate this 
idea into our MFC scheme, we first need to extend our notation: 

The list S = (cr(0), cr(l), . . .) C No is introduced, which is assumed to be in ascending order, 
in order to indicate time instants at which the control sequence is updated. The closed loop 
solution at time instant cr^n) is denoted by x„ = x^^(o"(n)). Furthermore, the abbreviation 
m„ := a{n + 1) — cr{n), i.e., the time between two MFC updates, is used. Hence, 

(11) 

to define the closed loop 



holds. This enables us 
control 



^/^]v('^(^) + ^n) = Xf,^{a{n + 1)) = Xn+l 

in view of Bellman's principle of optimality 



Xr. 



argmm 



V; 



N—mn\Xu\1^n: X^ 



i)) + ^ i{xu{k;xn),u{k)) 



(12) 



k=0 
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Describing the fact shown in \13j Section 7] more precisely, a lower bound on the degree of 
suboptimality aAr,m„ relative to the horizon length and the number of controls to be imple- 
mented rUn can be obtained. This bound allows for measuring the tradeoff between the infinite 
horizon cost induced by the MPC feedback law fi%{-', ■) similar to Theorem 2.3, i.e. 



s / 



(13) 



n=0 fc=0 

and the infinite horizon optimal value function Voo{xo) evaluated at Xq. We point out that the 



results shown in 13 ensure stability for a set of initial values. Hence, using a prediction horizon 



corresponding to the approach proposed in this reference leads to a performance estimate which 
may be conservative - at least in parts of the state space. Here, we extend ([8| to an m-step 



suboptimality estimate which is similar to 13 but can be applied in a trajectory based setting. 



Proposition 3.3 

Let a list S = {cT{n))n£no C Nq such that rrin = a{n + 1) — a{n) G {1, 2, . . . , — 1} holds he 
given. Consider the open loop system a;„j^(-; ■) of the feedback law ■), the closed loop 
trajectory Xn, n G No, of (11) with initial value xq G X and a fixed a G (0, 1]. // there exists a 
function Vat : X — )• Mq satisfying 



m.n-1 



VN{xn) >VN{xn+i) + a ^ (fc; a;„) , Mat (fc; Xn) ) 



(14) 



k=0 



with rrin G {1, . . . , — 1} for all n G No, then 



«V;o(xo) < aK^^(xo) < Vn{xq) < Voo(xo) 



(15) 



holds for ■) given by (12) for all n G Nq 



Proof Reordering ([Ti]), we obtain a^'^^Q'^ £{xur^{k; Xn),UN{k; Xn)) < VN{xn) - Vjv(a;n+i)- 
Summing over all the first n G No time instants yields 



n mi — I 



a 



£{xut,{k;Xi),UN{k;Xi)) < Vn{xo) - Viv(a;„+i) < Vn{xo). 



i=0 k=0 



Hence, by definition of /ifr(-; ■) in (12) and K»^(-) in (13), taking n to infinity implies the second 



inequality in (15). The first and the last inequality in (15) follow by the principle of optimality 
which concludes the proof. □ 

An implementation which aims at guaranteeing a fixed lower bound of the degree of subopti- 



mality a in the sense of Proposition 3.3 may take the following form: 



Algorithm 3.4 

Given state Xq := x, n = 0, list S = (n), N ^ N>2 and a G (0, 1] 
(I) Set j := and compute MAr(-;x„) and V]\f{x). Do 
(a) Set j := j + 1, compute VNiXu^{j] a;„)) 



(b) Compute a = {a \ a maximally satisfies (14) with m„ := j} 
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(c) If a > a: Set m„ 


:= j and goto (II) 




(d) If J = - 1: Set 


1 * J 'lit 1 J /tt\ 

m„ accordmg to exit strategy and goto (Ilj 




while a <a 




(11) 


For j = 1, . . . , m„ do 






Implement fi%{j - 


- l;x„) := MAr(j - l;Xn) 


(III) 


Set 5 := (5,back(5) + 


m„), := x^s (m„; x„), n := n + 1 and goto (I) 



Here, we adopted the programming notation hack which allows for fast access to the last element 
of a list. Note that S is built up during runtime of the algorithm and not known in advance. 
Hence, S is always ordered. 

Remark 3.5 



// (14) is not satisfied for j < N — l, an exit strategy has to be used since the performance bound 
a cannot be guaranteed. In order to cope with this issue, there exist remedies, e.g., one may 
increase the prediction horizon and repeat Step (I). For sufficiently large N , this ensures the local 



validity of (14). Unfortunately, the proof of Proposition 3.3 cannot be applied in this context 



due to the prolongation of the horizon. Yet, it can be replaced by estimates from 18, 121 for 



varying prediction horizons to obtain a result similar to (15). Alternatively, one may continue 
with the algorithm. If the exit strategy doesnot have to be called again, the algorithm guarantees 
the desired performance for Xn instead of Xq, i.e., from that point on. 



Utilizing Algorithm |3.4| the following result shows asymptotically stable behavior of the com- 
puted state trajectory: 

Theorem 3.6 

Suppose a control system ([T| with initial value xq G X and a G (0, 1] to be given and apply 
Algorithm 3.4 Assume that for each iterate n G No condition a>a in Step (Ic) of Algorithm 
3.4 is satisfied for some j G {1, . . . , A^ — 1}. Then, the closed loop trajectory corresponding to the 



closed loop control ■) resulting form Algorithm 3.4 satisfies the performance estimate (15) 



If, in addition, the conditions of Theorem 2.3(ii) hold, then x 
an asymptotically stable system. 



behaves like a trajectory of 



Proof. The algorithm constructs the set S. Since Step (Ic) is satisfied for some j G {1, . 
1} the assumptions of Proposition 



3.3 



i.e.. Inequality (14) for x. 



N- 

are satisfied. 

Hence, by Proposition 3.3, the performance Estimate (15) follows for the control sequence 
•)• Last, similar to the proof of Theorem 2.3 given in Theorem 7.6], standard direct 
Lyapunov techniques can be applied to conclude asymptotically stable behavior of the closed 
loop trajectory x^s (■). □ 

Example 3.7 

Consider Example 3J^ in the context of Theorem 3^ Recall that using the Riccati based open 
loop control law UN{-'iX{n)) in the standard MPC fashion ^N{x{n)) := un{0; x{n)) for horizon 
length A^ = 3 together with Theorem 2.3, there exist initial values for which a > cannot be 



guaranteed. Now, if we use Algorithm 3.4 instead, i.e. we allow for m„ > 1, then we obtain 
a > a = for each initial value Xq G X. In Figure [2] we illustrated the impact of changing m„ 
on the difference V/v(x„) — V/v(a^n+i) for N = 3. In particular. Figure |2|^left) shows that there 
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Figure 2: Left: Difference Vs{xn) — Vs{xn+i) with m„ = 1. Right: Difference Vs^Xn) — V3(a;„-t-i) 
for with m„ = 2. 



exists regions in the state space where the value function is increasing whereas V3(-) in Figure 
[2|^right) is always decreasing, i.e. all trajectories converge towards the origin. 
Using ai(||a;||) = a3(||a;||) = ||x|| = mmu<^u i{x,u) < Vn{x) = x'^P^x < A(P)||a;|| = a2(||a;||) 
where A denotes the largest Eigenvalue of the matrix Pjy ensures the assumptions of Theorem 
|2.3[ii). Hence, applying Theorem 3.6 enables us to conclude asymptotic stability of the closed 



loop generated by Algorithm 3.4 



Example |3.7| indicates that checking the relaxed Lyapunov inequality less often may allow us 
to maintain stability for a reduced prediction horizon. We point out that in Example 3.7 



condition (14) has been ensured in advance, i.e., before implementing a control input at the 
plant. Algorithm 3^ may vary the number of open loop control values to be implemented 
during runtime at each MFC iteration. In particular, the system may stay in open loop for 



more than one sampling period in order to guarantee ( 14 ) to hold. Such a procedure may lead 
to severe problems in terms of robustness, cf. |4T9 . Hence, from a practical point of view it is 
preferable to close the control loop as often as possible, i.e., m„ = 1 for all n G Nq. Furthermore, 
for many applications stable behavior of the closed loop is observed for m„ = 1 even if stability 
cannot be guaranteed via (Isj), cf. Example 3.2 In the following, we deduce conditions which 



allow the control loop to be closed more often compared to Algorithm 3.4 while maintaining 
stability. 



4 Robustification by closing the loop more often 



If rUn > 1 is required in Step (I) of the proposed algorithm in order to ensure (14), the following 



methodology which has been proposed in 21 , Section 4] allows us to prove the following: Given 



a certain condition, the degree of suboptimality a in ( 14 ) is maintained for the updated control 

UN{k;Xn), 



UN{k;Xn) :-- 



k<3 

UNik - j;xuj^ij;xn)), k>j 



(16) 



with j G {1, . . . , m„ — 1} and where a{n) + j is added to the hst S. The theoretical foundation 
of such a method is given by the following result: 

Proposition 4.1 

Let the open loop system Xu^^i-; ■) ^/ © uiith initial value Xn be given and inequality (14) 
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hold for un{-] a e (0, 1], and rrin G {2, . . . , — 1}. // the inequality 



Vn( (j; Xn)))- VN-j{Xuj,U'^ ^n)) (17) 

{j;Xn)),UN{k 

k=0 k=j 



holds for some j G {1, . . . , m„ — 1}, then the control sequence un{-', Xn) can be replaced by (16) 
and the lower bound on the degree of suboptimality a is locally maintained. 



Proof. In order to show the assertion, we need to show ( 14 ) for the modified control sequence 



(16). Reformulating (17) by shifting the running costs associated with the unchanged control 



to the left hand side of (17) we obtain 



m„ — l 



{j;xn))) - Vn{ (0;x„)) < -a ^ ^{xuj,{k] 



k=0 



which is equivalent to the relaxed Lyapunov inequality (14) for the updated control UN{-;Xn)- 

□ 



Utilizing Proposition 4.1 in Algorithm 3.4 we see that only Steps (II) and (III) need to be 
changed and may take the following form: 



Algorithm 4.2 (Modification of Algorithm 3.4) 



(II) Set n := 1 and s := back(iS). For j = 1, . . . , rrin do 
(a) Implement - 1; x„) := UnH - 1; Xn) 



(b) Compute tiAr(-; x„j^(j; a;„)), construct U]\i{-;Xn) according to (16) and compute 
Vn{ 



(c) If condition (17) holds: 

Update UN{-;Xn) := UAr(-;a;„). If j < m„ set S := {S,s + j), Xn+h ■= x^s^{j]Xn) 
and n = h + 1. 

(Ill) Set S := {S, s + nin), Xn+h '■= x^s^{mn] Xn), n := n + h and goto (I) 



Due to the principle of optimality, the value of VN-j{xuj^{j', Xn)) in (17) is known in advance 
from V]\f{xn). Hence, only UNi-', Xuj^iJ] Xn)) and V^^Xu^^mn', Xn)) have to be computed. This 
result has to be checked with VN{xn) for all j G {1, . . . ,m„ — 2}. Hence, the updating instant 
(T{n) has to be kept in mind. Note that a complete, yet more conservative update without the 



requirement of storing cr(n) can be performed using the method proposed in 21, Proposition 



4.3]. We also like to stress the fact that condition (17) allows for a less fast decrease of energy 
along the closed loop, i.e., the case V)v(x^jv("^n! ^n)) — VN{xuj^{mn',Xn)) is not excluded in 
general which is illustrated by the following example. 

Example 4.3 



Consider Example 3.1 and the initial values Xq = (0, 1)^ with prediction horizon A^ = 3. If we 
apply Algorithm [42[we obtain Vn{xo) = 5.109994744 and Vn{xi,^{2; x)) = 2.83461176 which 
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yields a = 0.5136. If Algorithm 3.4 is employed we obtain Vjy^Xuj^i^^x)) = 2.827656536 which 
implies a = 0.5144. Hence, Algorithm |4.2| accepts also deteriorations as long as the desired 



suboptimality degree is still maintained. 

Taking xq = (1,0)^ as initial value shows that the impact of Algorithm 4.2 may also improve 
these key figures: In this case Algorithm 4.2 provides Vj\f(x) = 4.08117251, VAr(x^^(2; x)) = 
0.96290399 with a = 0.7733 whereas Algorithm [0| gives us V]v(x„j^(2; x)) = 1.22718283 with 
a = 0.7470. For a further comparison we refer to the numerical experiments in the ensuing 
section. 



Next, a counterpart to Theorem 3.6 based on Algorithm |4.2 instead of Algortihm 3.4 is estab- 
lished. This results allows us to verify that modifying and, thus, robustifying the algorithm 
still leads to the desired stabihty and performance properties. To this end. Proposition |4.1| is 



applied iteratively to show asymptotically stable behavior of the state trajectory generated by 
Algorithm |3.4[ 



Theorem 4.4 

Let a control system ([T]) with initial value xq € X and a G 
Algorithm \3.4\ with the modification of Algorithm \4.2 



(0, 1] be given. Furthermore, suppose 
IS applied. Assume that the condition 
a > a in Step (Ic) of Algorithm 3.4 is satisfied for some j G {1, ... — 1} for each iterate 
n G Nq. Then the closed loop trajectory corresponding to the closed loop control ■) resulting 
from the used algorithm satisfies the performance estimate (15) from Proposition 3.3. If, 



addition, the conditions of Theorem 2.3 (ii) hold, then x 
asymptotically stable system. 



behaves like a trajectory of an 



Proof. The list S constructed by the algorithm contains all time instants at which the sequence 
of control values is updated by the MFC feedback law ■). Hence, different to the basic 



inequality (14) holds. 



Algorithm |3.4| the list S may not only contain time instants at which the relaxed Lyapunov 

(T('T-))neNo constructed analogously to S from Algorithms 
the modified Step (II) ol 
1}, Inequality (|I4)) with x 



3.4 



4.2 



Consider a second hst S 
but which is not updated in the modified Step (II) of Algorithm |4.2[ Since Step (Ic) is satisfied 
for some j G {1, 



[T{n)) and 



1 



Tin 



is satisfied. Using Froposition 4.1 ensures that this inequality is maintained despite of updates 



carried out by Algorithm 4.2 Hence, by Froposition 3.3, the performance estimate (|15|) follows 
for the control sequence •)• Again, similar to the proof of Theorem 2.3 



given m 



12 



Theorem 7.6], standard direct Lyapunov techniques can be used to show asymptotically stable 
behavior of the closed loop x^s (■). □ 



An important conclusion of Froposition 4.1 and Theorem 4.4 is the following 



X. 



Corollary 4.5 

Consider the open loop system x„^(-; ■) of (|6|, the feedback law /ifr(-; ■); ^he closed loop trajectory 
n G Nq; of (11) with initial value Xq G X and a fixed a G (0,1] to be given. Moreover, 
suppose inequality (14) to hold foruN{-] Xn), a G (0, 1], and ntn G {2, . . . , — 1}. // (17) holds 
for all j G {1, . . . , rrin — 1} and all n G No, then ^ holds, that is the standard MPC feedback 
tiN^-) can be applied. If, in addition, the conditions of Theorem 2.3(ii) hold, then x^^{-) behaves 
like a trajectory of an asymptotically stable system. 



Proof. Follows directly from Theorem 4.4 



□ 



The following example illustrates the impact of the modified algorithm, but it also indicates 
that the lower bound may still not be tight. 
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Example 4.6 



In Example 3/7 stability can be shown if one allows for implementing more than than one 
element of the open loop sequence of control values, i.e., m„ > 1. Now we use Algorithm 
4.2 instead of Algorithm 3^, i.e. we allow for longer control horizons m„ and try to verify 
whether the control loop can be closed more often. And indeed, we obtain a > a = for 
each initial value Xq G A" with m„ = 1, that is we can show stability for standard MPC 



according to Corollary |4.5[ Moreover, taking Example |3.7| into account, this assertion holds for 
all X e X = 

Yet, if the suboptimality bound is slightly increased to a = 0.01, the condition a > a in Step 
(Ic) is not satisfied for any j G {1, . . . , — 1} for trajectories emanating from a subset of X, 
cf . Figure |3| 



Figure 3: Illustration of initial values (grey) where a < in dsl) (^ir) whereas a < a = 0.01 in 



Step (Ic) of Algorithm 3.4 together with Algorithm 4.2 for at least one iterate n 



Intuitively, one could guess that the performance of the Riccati based feedback law in our 
example is better than a = 0.01. In the following section, we address this issue indirectly as 



an outcome of an exit strategy required in Step (Id) of Algorithm 3.4, see also Remark 3.5 



5 Acceptable violations of relaxed Lyapunov inequality 

Until now we have supposed that our relaxed Lyapunov Inequality (|T4j), i.e.. 



Pi = Pi{a) ■■= VN{xi) - VN{xur,{mi] Xi)) - a ^ ^{xuf,{k] Xi),UN{k; Xi)) > 0, (18) 



fc=0 



is satisfied for some mi G {1,2,...,A^ — 1}. It is well known that (14) and, thus, pi > 
holds for sufficiently long horizon A^, see 



16 . However, this is not necessarily true 



for short horizons A^ - even if the closed loop shows asymptotically stable behavior. Our basic 



Algorithm 3.4 allows us to cope with such a case via an exit strategy in Step (Id). As outlined 



in Remark 3^, one possible way to deal with this issue is to sufficiently increase the length of 
the horizon. Due to the corresponding high computational costs, however, we want to avoid 
such an approach. Here, we first present the following result which uses an idea similar to the 
watchdog technique in nonlinear optimization, see [6]. Lateron we will show how this result 
can be utilized in order to modify our algorithm. 
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Theorem 5.1 

Let a list S and, thus, a sequence (mi)i6N(, C {1, 2, . . . , — 1} be given. For optimization 
horizon N and initial state xq e X, the feedback law ■) generates the closed loop trajectory 
(xj)igNo according to (11). Consider the open loop system ■) of a G (0, 1], and the 

sequence (pi)ieNo from (18) to be given. Furthermore, suppose there exist ai{-), a2{-) G /Cqo such 
that a2{\\xi\\x*) > V^ixi) and i*{xi) > ai{\\xi\\x*) hold for all Xi, i G Nq. Then, convergence of 
(sn)neNo with Sn Yll=(d Pi ^.nsures the convergence Xi — )■ fori tending to infinity, i.e., x^s^(-) 
behaves like a trajectory of an asymptotically stable system. Furthermore, we have VN^Xi) 
for i approaching infinity. 

Proof. Suppose that the sequence (VAr(xi))igNo is unbounded. Furthermore, let £ > be given. 
Since {sn)neNo converges, (pi)jeNo is a Cauchy sequence. Hence, there exists M = M{e) G Nq 
such that \pn—Pm\ < £ for all n,m > M with n,m E Nq. Since (VAr(xj))jgNo C Mq is unbounded, 
there exists an index k > M, k G No, such that Vn{xn) + £ < VAr(xfc). This leads to 

k—l m;— 1 

£>\pN-pk-i\= Vn{xm) -VN{xk) -a^ ^(a;«jv(i;2;i),MAr(j;Xi)) 

i=M j=0 

k—l nii — l 

= V)v(Xfc) - Vn{xm) X] ^i^UNij'^^i)^^Nij;Xi)) 

i=M j=0 

> VN{xk) - Vn{xm) > £, 

where we used positive semidefiniteness of £(■, ■) in the second last inequality. This inequality 
contradicts our unboundedness assumption of (VAr(xj))jgNo? hence (VAr(xj))jgNo is bounded. 
Now let M := supjgp^Q V]sr[xi) < oo. Furthermore, let £ > be given. Since {sn)n(^'No converges, 
there exists M = M{e) G No such that \pi\ < e for all i > M. Suppose that the sequence (xj)igNo 
is unbounded. Then, there exists Xi with i > M, i G No, such that i*{xi) = min„g[/ ^(xj, m) > 
ttidla^ilU*) > {M + 6)/a. Hence, we have 

rrii-l 

£ > \pi\ > « X] i{xuj^{k;Xi),UN{k;Xi)) + VW(a;„j^(mi; x^)) - VV(a;i) 

fc=0 

> at{xi) - VN{xi) > a{M + e)/a - M = e, 

contradicting the unboundedness assumption of (xj)jgNo- Consequently, (xj)jgNo is bounded. 
The deduced boundedness of (a;^)^^^^ and ensures that (xj, y(a;j))jgN„ C X x Mq 

contains a convergent subsequence (xj^. , with ij < ij^i for all j G No. Now suppose 

that 7^ X* for j tending to infinity and define e := aa;i(||a;||^*)/4. Since VAr(a;i^)jgN and 

(sn)neNo converge, there exists M = M{e) G No such that 

\VN{xi^) -VN{xi^^J\ < e and \pi,\ < £ and HxiJI^;* > ||x||3;*/2 

hold for all j G Nq. As a consequence, we obtain 

nii —1 



|Pi,|>a 2^ iixuj^ik;Xi^),UNik;Xy)) + VNixuj^imi^;Xy)) -VNixi^) 

k=0 

> at{xi.) -e> 11^.*) - e> aai(||x||^*)/2 - £ = e, 



which contradicts \pi\ < e. Hence, (xj.) — )■ x* for j approaching infinity. Using Vn{x.i) < 
ct2{\\xi\\x*), i G No, ensures V{xi.) — )■ for j — )■ oo. Let £ > be given and N = N{e) G N be 
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chosen such that |p„ — Pm\ < ai{e)/2 for all n, m > A^, G N, m G N. Choosing j = j{e) G N, 
j > N such that VN{xi^ < ai{e)/2 yields 



i m„ — l 



Xi) + a ^ ^ ^ ^ i{xuf^{k', Xn), Uj\f{k; — Vj\f{xi^) + V]v(xj 



i m„ — 1 



< \VNixi^) - VN{xi) - " ^{xuNik; Xn),UN{k;Xn))\ + ai{e)/2 

n=ij k=0 

< \Pij - Pi\ + «i(£^)/2 < (yi{e) 

and, thus, < e holds for all i > ij. Since e may be chosen arbitrarily we obtain Xj — )■ x* 

which ensures asymptotic stability of Xi with i G Nq. Repeating the argument used in order to 
show that VN{xi.) — )■ for j tending to infinity for {xi)i^no completes the proof. □ 



Theorem |5.1| ensures asymptotic stability but does not guarantee the desired performance 
specification. We like to point out that pi, i G No, does not have to be positive. Indeed, even 
Vjv(-) may increase along the closed loop trajectory before it finally converges to zero. 
Analyzing the sequence (s,i)„gNo and its limit more carefully leads to the following corol- 
lary which allows us to generalize Algorithm 3.4 by incorporating knowledge of the sequence 

(•Sn)neNo- 

Corollary 5.2 

Let a G (0, 1] be given and s„ > hold for some ri G No with Sn from Theorem 5.1. Then, the 
dosed loop trajectory x^s^{-) satisfies the relaxed Lyapunov inequality at Xn+i, i-e., 



n mi — l 



VN{Xn+l)+Oi^ ^ i{Xuj,{k\Xi),UN{k\Xi)) <Vn{xo). 



(19) 



j=0 k=0 



//lim„_>ooS„ > holds, then the suboptimality Estimate (15) is satisfied. 



Proof. We obtain the stated relaxed Lyapunov inequality directly by inserting the definition 
of Pi from (18) into s„ and using the equivalence of the open and closed loop control un{-', Xi), 
/^Ar('i^i) from (12) which allows us to replace x^j^ij^i^Xi ) by Xj+i. In order to show (US^ we 

have to establish aVi^^(xo) < Vn{xq). To this end, we define 

n rrii — l 

Sn ■= Vn{xo) - ^ £{xuj^ik;Xi),UNik;Xi)) = s„ + VAr(a;„^(mj; Xj)). 

1=0 k=0 

The fact that the range of V/v(-) and £(•, u) is contained in Mg for all -u G f/ ensures that (s„)„gNo 
is a monotonically decreasing sequence which satisfies s„ > s„ for all n G No. Since s„ < 
for n G No contradicts the positivity of lim„_s.oo Sn, we conclude > for all n G No. Hence, 
(sn)neNo is monotouically decreasing and bounded and, thus, converges to 5 > lim„_s.oo Sn > 0. 
This yields 

n rrii—l ^ 

Vn{xo) > lim ^ i{xuj^{k; Xi),UN{k; Xi)) = aV^''{xo), 

i.e., the desired assertion. 



1=0 fc=0 



□ 
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The key idea of Corollary 5.2 in comparison to Theorem |2.3Ki) and Proposition 3.3 is to allow 



for intermediate increases within Iph or (14) for certain time instants n which corresponds to 



Pn < 0. Such a behavior is typical if the system is not minimal phase with respect to the cost 
functional and has to be accounted for if the cost functional cannot be adapted appropriately. 
We point out that the conditions of Theorem |5.1| and Corollary |5.2| with respect to lim„_^oo Sn, 



unlike conditions (pi) or ( 14), cannot be checked at runtime. Still, while the maximal a satisfying 



([8j) or (14) is locally a lower bound on the degree of suboptimality, cf. (14), the knowledge of 
Sn allows us to compute such a bound for a horizon of length ^"=o^»' i^^^- Note that the 
latter bound uses the stage costs as weighting factors. 



Corollary 5.3 

Consider a feedback law fif^{ 
well as the corresponding closed loop trajectory Xn, n G Ng, of (11) with initial value Xq to 
be given. Furthermore, suppose a G (0, 1] to be fixed and {sn)n<^No be defined as in Theorem 



with sequence {mn)n&n> £ {1? 2, . . . , A^ — 1} for alln as 



(5.1). Then we have the following: 



{a I a maximally satisfies (19)} 



Vn{xo) - VN{Xn 



+1. 



Vn{,Xo) - Vn^Xu+i) - Sn 



a. 



(20) 



9, then (15) holds with degree of suboptimality 



(a) If (sn)neNo converges with lim 
a = aVN{xo)/iVN{xo) - 6). 

Proof. To prove (i) we first reformulate the definition of Sn to obtain 

Vn{xq) - VN{Xn+l) - Sn 



n rrii—l 



i{xuj^{k;Xi),UNik;Xi)) 



i=0 k=0 



a 



(21) 



where we used the equivalence of the open and closed loop control uj\f{-;Xi), /ifr(-; Xi) from (12) 
to replace Xuj^imi] Xi) by Xj+i for z = 0, . . . , ra. To obtain {a \ a maximally satisfies (19)} we 



consider \\19l to hold as an equality and solve for a. Now, we can use (21) to substitute the 



resulting denominator which gives us ( 20 ) . 



In order to obtain assertion (ii), we use the following fact shown in the proof of Theorem 5.1 



If ('Sn)nGNo couvcrges, then VAr(xj) — )■ as z — )■ oo. Hence, taking n to infinity in (20) shows 
assertion (ii). □ 



Corollaries 5.2 and 5.3 give rise to a possible exit strategy in Step (Id): If s„ remains positive. 



then asymptotic stability and the desired performance bound can be guaranteed if the control 



loop is closed. One possible implementation of an algorithm based on Corollary 5^ is the 
following: 



Algorithm 5.4 (Extension of Algorithm 3.4) 

Given state Xq := x, n = 0, list S = (n), N G N>2, a G (0, 1] and Sr 
(I) Set j := and compute MAr(-;x„) and Vn^x). Do 
(a) Set j := j + 1 and compute VAr(x„^(j; a;„)) 



(b) Compute from (18) and set p„(j) := Pn 
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(c) If Pn{j) ^ holds: Set = j and goto (II) 






\ ^' AT 1 . 

(d) It J = iV - 1: 






If Sn + maxjg{i_ 7V-1} Pn{j) < 0: Print warning, set m„ := 1 and j 
Else: Set m„ := min{j* | j* = argmin^gii^ -p„(j)}, p„ : = 
goto (II) 


^oto (II) 

-- pnirun) and 




while Pri < 




(11) 


For j = 1, . . . , m„ do 

Implement fi%{j - 1; x„) := MAr(j - 1; x„) 




(III) 


Set 5 := (>S,back(5) +m„), s„+i := s„ + p„(m„), x^+i := x^s(m„;x„), n 
goto (I) 


:= n + 1 and 



If Sn+i = Sn + maXjg{i^...^Ar_i} p„(j) < the performance Estimate (15) cannot be guaranteed. 
Hence, even if lim„_5.oo s„ > holds, the exit strategy cannot be used since this knowledge is 



not at hand at time instant n. Furthermore, we like to point out that Algorithm |5.4| coincides 
with Algorithm 3.4 for n = 0, i.e., the first MFC step. Note that this is the only time instant 
at which we may increase the optimization horizon N such that the presented stability proofs 
still hold. Hence, we can repeat Step (I) of the algorithm in order to ensure the desired relaxed 
Lyapunov inequality. In contrast to that, reducing may be done at runtime of the proposed 
algorithms. 

Remark 5.5 

Using the definition of pi in (18) and Vn{x) > Vjy^kix) for k E {0, . . . , N — 1} we obtain 



-1 

k=0 



Pi := VN{xi) - VN~kixuN{mi;Xi)) - a i{xur^{k; Xi),UN{k; Xi)) > pi{a) = p. 



for k E {0, . . . , N —1}. Hence, the telescope sum argument used in the proofs of Proposition 3.3 



and Theorem 5.1 still holds if the horizon length N decreases monotonically along the closed 
loop. 



Remark 5^ gives rise to the following strategy: First, a large optimization horizon is chosen 
to avoid the startup problem. Then, the horizon can be reduced gradually along the closed 



loop provided the relaxed Lyapunov inequality ( 14 ) holds for the reduced horizon for all future 



time instants. In context of Theorem 5.1 one has to guarantee that s„ stays positive along the 
closed loop in order to reduce the optimization horizon. Note that an algorithm based on a 
quantity representing slack from proceeding steps has also been designed in j8| Theorem 1] in 
the context of varying optimization horizons. 

Remark 5.6 

For a > an additional exit strategy is the following: if < a < a holds for suboptimality 



index a computed from Corollary 5.3(i), then stability or a certain performance bound may 



still be ensured. Additionally, Corollary 5^ allows for checking whether the originally desired 
suboptimality estimate a>a is guaranteed again at a later time instant. 

In order the check whether the control loop can be closed more often without loosing stability or 



violating the lower bound on the degree of suboptimality a. Corollaries 5.2 and 5.3 are employed 
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directly. Note that this is possible since the sequence (s„)„gNo gives us an absolute value with 
respect to the desired decrease along the closed loop - contrary to Propositions 3^ and 4A_ 
which have to be interpreted in terms of the stage costs One possible implementation of 

the update check is the following: 



Algorithm 5.7 (Modification of Algorithm 5.4) 



(II) Set n := 1 and s := back(iS). For j = 1, . . . , m„ do 
(a) Implement fi%{j - 1; a;„) := u^U - 1; 



(b) Compute U]si{-; Xu^{j', Xn)), construct Uj^{-;Xn) according to (16) and compute 



c) Compute p„ from (18) with ujv(-;a;„) replaced by U]\f{-; x{n)) and set Pn{j) '■= pn 



(d) If + maXje{i,...,Ar„i} p„(j) > holds: 

Update un{-; Xn) := un{-; a;„). 

If j < m„: Set S := {S, s + j), Sn+h ■= Sn+n-i + Pn{j), Xn+n ■= x^s^ij] x„) and 
n = n + 1. 

(Ill) Set S := {S,s + rrin), Sn+h ■= Sn+n-i + Pn{mn), Xn+h ■= X ^3^(171^, Xn) , Ti := u + fi and 
goto (I) 



Example 5.8 

Consider Example 3.1 one last time. As we have seen in Example 3.7 and 4.6 stability of the 



closed loop can be shown for initial values xq E X hj means of Propositon |3 .3| for m„ > 1 and by 
Corollary 4^ for m„ = 1. In Example |4.6 it has also been shown that one cannot guarantee the 
lower performance bound a = 0.01 by Theorem 4.4 Yet, we would expect a better performance 
of the Riccati based feedback law. And indeed, using Algorithm 5.7 together with Corollary 
5.3| we obtain a = 0.52307 for standard MPC (m„ = 1) with horizon length A = 3. In Figure 
4] the values of a resulting from Proposition 3^ and Corollary 5^ are displayed for different 
horizons N showing the improvement of Corollary 5.3 For reasons of completeness, we also 



displayed the performance results from 20, Section 6]. Note that while the latter hold for the 
entire state space X = M^, our results are only exact up to discretization accuracy. Still, the 
improvement of suboptimality bounds is significant and allows us to reduce the optimization 
horizon from A = 5 as shown in 20 to A = 3. 

s„+maxjg|i . 7V-1} Pn(j) < along the closed loop for exactly 



5 as shown in 
Unfortunately, we observe Sn+i = 
the same initial values for which condition a > a in Step (Ic) of Algorithm 3.4 is not satisfied 
for at least one iterate n, cf. Figure |4](right). 



Remark 5.9 

The proposed algorithms and theoretic results in this paper are designed in a trajectory based 
manner. Therefore, these methods are not suited in order to ensure asymptotic stability or a 
desired suboptimality degree for a set of initial values A" C X — in contrast to the approaches 



presented in 11,20L Yet, incorporating the conditions presented in Proposition 4-1 or Corollary 



5.2 in the methodology proposed in ii, Section 4j is possible. This topic will be subject to future 
research. 
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• t 8 



Figure 4: Left: Illustration suboptimality bounds from 20, Section 6] (o), a from Proposition 
3.3 {-k) and a from Corollary 5.2 (•) for increasing horizon length A^. Right: Illustration of 
initial values for which = s„ + maxjgji Ar_i} p„(j) < holds for at least one iterate n (•) 



6 Conclusions 

An algorithm based approach has been presented which ensures stability of the MFC closed 
loop without terminal constraints or costs. In particular, the proposed methodology allows 
for deducing stability and performance bounds for comparatively small optimization horizons. 
These results are based on structural properties of a relaxed Lyapunov inequality for the open 
loop which are computed a priori. In order to robustify the outcome of the corresponding 
algorithm, conditions which guarantee maintaining of this Lyapunov inequality despite closing 
the control loop at additional time instants have been derived. Last, a further improvement 
has been achieved by incorporating an accumulated quantity in the presented algorithms which 
reflects previous decrease in terms of the value function of the MFC problem. Doing so yields 
an exit strategy which often resolves problems occuring within our basic algorithms if the 
optimization horizon is chosen to small. Furthermore, enhanced performance estimates are 
obtained. 
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